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ABSTRACT 

Comparative tests were performed on seven 
signature extension algorithms to evaluate their 
effectiveness in correcting for changes in atmos- 
pheric haze and Sun angle in a Landsat scene. Four 
of the algorithms were cluster matching, and two 
were meiximum likelihood algorithms. The seventh 
algorithm determined the haze level in both training 
and recognition segments and used a set of tables 
calculated from an atmospheric model to determine 
the affine transformation that corrects the training 
signatures for changes in Sun angle and haze level. 
Three of the algorithms were tested on a simulated 
data set, and all of the algorithms were tested on 
consecutive-day data. The classification performance 
on the data sets using the algorithms is presented, 
along with results of statistical tests on the accu- 
racy and proportion estimates. The three algorithms 
tested on the simulated data produced significant 
improvements over the results obtained using untrans- 
formed signatures. For the consecutive-day data, 
the tested algorithms produced improvements in most 
but not all cases. The tests indicated also that no 
statistically significant differences were noted 
eunong the algorithms. 


1. INTRODUCTION 

Signature extension is the process of using signatures from a given segment"*" 

(the training segment, or T-SEG) to classify another segment (the recognition segment, 
or R-SEG) . If such a procedure gave classification accuracies and proportions that 
were comparable to those obtained with local training, it would save much time and 
effort in a project such as the Large Area Crop Inventory Experiment (LACIE) , where 
many segments must be classified. The simplest approach is to use the untransformed 
(UT) signatures from the T-SEG. Generally, this does not work well because these 
signatures are different from the signatures in the R-SEG as a result of differences 
in haze level. Sun angle, euid various factors which affect target reflectance, such 
as soil color and growth stage. 

In this study, comparative tests were performed on seven signature extension 
algorithms to evaluate their effectiveness in correcting for changes in atmospheric 
haze and Sun angle in a Landsat scene. The evaluation criteria were classification 
accuracy and proportion estimation. The algorithms tested were the Maximum Likeli- 
hood Estimation of Signature Transformation (MLEST) , the University of Houston Maxi- 
mum Likelihood Estimator (UHMLE) , the Optimal Signature Correction Algorithmic 
Routine (OSCAR) , modified OSCAR (MOD OSCAR) , the Rank Order Optimal Signature Trans- 
formation Estimation Routine (ROOSTER) , modified ROOSTER (MOD R) , and the Atmospheric 
Correction (ATCOR) program. 

‘Under National Aeronautics and Space Administration Contract NAS 9-15200 at the 
Lyndon B. Johnson Space Center, Houston, Texas. 

"*"A segment in this paper is a 9-by-ll-kilometer ground area. 
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2. THE DATA SETS 


Two data sets were used — one consisting of simulated data (section 2.1) and 
the other a set of acquisitions on consecutive days (section 2.2). The simulated 
data provided for a controllec experiment in which the transformations were known 
and in which the problems of nonnormal distributions and nonrepresentative statis- 
tics were avoided. The consecutive-day data set provided for a test of the capa- 
bility of the algorithms to correct for atmospheric effects when effects caused by 
differences in the training and recognition targets are eliminated. The algorithms 
ROOSTER, UHMLE, and MLEST were tested on the simulated data. All the algorithms 
were tested on the consecutive-day data set. 

2.1 SIMULATED DATA 

The 1975 data base of the Earth Resources Interactive Processing System (ERIPS) 
contains four passes of four-channel simulated data for each of segments 429 and 432. 
Each segment has 117 lines and each line 196 pixels. The field coordinates reside in 
the ERIPS field data base. Four classes exist within each segment: wheat (W) , 

barley (B) , stubble (S) , and grass (G) . Each class is divided into two subclasses. 

The data were generated from means and covariance matrices determined from train- 
ing fields in Hill County, Montana. An algorithm was used to generate multivariate 
normal data with the same statistics. This was done separately for the four passes 
of segment 429. Each pass of segment 432 was created from the distributions used 
in the corresponding pass of segment 429 by transforming them with an affine trans- 
formation so that the data corresponded to a different Sun angle. Segment 429 was 
chosen to be the T-SEG and segment 432 the R-SEG. All classifications were made in 
four channels. Four data sets correspond to the four passes: SIMl, SIM2, SIM3, 

and SIM4. 

2 . 2 CONSECUTIVE-DAY DATA 

Seven sets of consecutive-day passes of Landsat-1 data from intensive test 
sites in Ellis, Finney, and Saline Counties, Kansas, were tested. The first set is 
denoted F1709-8. (The F indicates Finney County; 1709-8 indicates the dates of the 
training and recognition passes, respectively; i.e., the training pass was made 
1709 days and the recognition pass 1708 days after the launch of Landsat-1.) In 
all, four sets from the Finney, two from the Saline, and one from the Ellis County 
test sites were used. 

Ground truth was available for all fields in all test sites. A subset was 
selected for training fields, and fields were grouped into subclasses with the aid 
of cluster maps. In general, the rectangular ground- truth areas were not oriented 
so that their sides were parallel to the scan lines in the Landsat-1 data. To 
facilitate the application of the various algorithms, a "signature extension area" 
was defined (the smallest rectangular area with sides parallel to the Landsat scan 
lines) that included the ground- truth area in each case. For Finney County, this 
included the entire 9- by 11-kilometer segment (117 lines, 196 pixels) containing 
the ground-truth area. For Saline County, it included lines 26 to 91 and pixels 27 
to 146; for Ellis, it included lines 24 to 109 and pixels 49 to 144. 

3 . APPROACH 

The overall approach was to make signature extension runs using these algorithms 
and to compare the results with local classification results or ground truth. The 
algorithms were to provide modified training statistics which then were used to clas- 
sify the recognition area. The UHMLE computes these modified statistics directly; 
all the other algorithms compute an affine transformation which is then used to 
modify the training statistics. 

3 . 1 THE ALGORITHMS 

The descriptions given here provide only a very rough idea of how these algo- 
rithms work. References are given to more detailed discussions. In the case of the 
consecutive-day data, the algorithms were usually run using the data from the signa- 
ture extension area defined above. Exceptions will be noted. 

3.1.1 MLEST . The MLEST technique [1] uses an iterative gradient optimization 
procedure (the Davidon-Fletcher-Powell algorithm) to obtain maximum likelihood esti- 
mates for the affine transformation assumed to relate the training and recognition 
statistics. The training subclass a priori probabilities and statistics are input 
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to the program, which outputs the maximum likelihood estimate of the affine 
transformation . 

3.1.2 UHMLE. The UHMLE [2] takes sublcass statistics from a T-SEG and image 
data from an R-SEG and computes maximum likelihood estimates of subclass proportions 
and statistics for the R-SEG. Two versions of UHMLE were used. The first, UH all, 
uses the ground-truth area as input data; when this version is used to obtain maxi- 
mum likelihood estimates of proportions generated internally by UHMLE, it is referred 
to as UH all MLE. The second, UH fields, uses only the training fields within the 
R-SEG; when this version is used to obtain maximum likelihood estimates of propor- 
tions generated internally by UHMLE, it is referred to as UH fields MLE. The second 
version was introduced to eliminate the effect of insufficient training. The 
statistics generated by UHMLE are used to classify the ground-truth area in the 
R-SEG. 

3.1.3 OSCAR. The OSCAR [3] considers every possible transformation defined by 

four cluster means: two in the T-SEG and two in the R-SEG. From these transforma- 

tions, the algorithm selects those that are "best" able to match the training 
clusters with the recognition clusters. The aunount of computation is kept to a 
manageable level (1) by rejecting pairings judged to be unreasonable on the basis 

of rankings and (2) by testing the remaining transformations, using each to trans- 
form all the training clusters, and calculating a measure based on the distance of 
the transformed training clusters from the recognition clusters. The five transfor- 
mations giving the "best" measure are then averaged. 

3.1.4 MODIFIED OSCAR. The MOD OSCAR [4], in effect, defines a transformation 
for each pair of clusters — one in the R-SEG and one in the T-SEG. Each cluster is 
used with its projection onto the soil line^ to define a transformation. The 
transformations are evaluated as in OSCAR, and the best transformation is output. 

3.1.5 ROOSTER. To perform signature extension with ROOSTER [5], one first 
obtains a set of class means for the T- and R-SEG' s. These class means, called mean 
vectors, are obtained by clustering or by deriving class statistics from training 
fields . 

The first step is to derive rank vectors corresponding to each of the mean vec- 
tors. These rank vectors are obtained by computing for each channel the rank of each 
mean relative to the others for that segment. The rank vectors are used to match 
the classes (or clusters) in the training area with those in the recognition area. 
Then, a regression analysis is used to determine the affine transformation which best 
transforms the mean vectors from the training area into the corresponding mean vec- 
tors from the recognition area. 

In this study, the ROOSTER was used in three different ways; The first, R(C) , 
consisted of using clusters to define the class means for both segments; the second, 
R(S) , used subclass means derived from training fields for both segments (It is 
expected to provide cm estimate of how well ROOSTER would do if an ideal clustering 
algorithm were available.); and the third, R(S/C), used subclass statistics for the 
T-SEG and clusters for the R-SEG. This is an alternate way of using ROOSTER opera- 
tionally, since subclass statistics are always available for the training area. 

3.1.6 MODIFIED ROOSTER. The MOD R [4] is identical to ROOSTER except that 
the regression line is computed with the cluster means and the projections of the 
cluster means onto the soil line. 

3.1.7 ATCOR. The ATCOR program [6] is designed to correct for differences in 
haze level and Sun angle between the training emd recognition data sets. The pro- 
gram processes each of these data sets separately. In each case, the input is the 
Landsat-1 data and the solar zenith angle. The ATCOR program determines the haze 
level from the brightness of certain dark targets in the scene and uses an atmos- 
pheric model to calculate a set of coefficients relating the Landsat data for that 
scene to the reflectance of the targets on the ground. The coefficients obtained 
from the training and recognition data sets are then used to compute the affine 
transformation to be applied to the training data. 


The soil line is the "bottom of the tasselled cap" or that part of channel 
space containing bare soil. 


3.1.8 REGRES. Rather them a signature extension algorithm, the REGRES program 
is a method for finding the optimiim affine transformation to be applied to the sta- 
tistics of the consecutive-day data. In each channel, a scatter plot is made of the 
second-day data versus the first-day data. A straight line is then fitted to the 
data which minimizes, in the least squares sense, the perpendicular distance from 
the points to the line. In principle, this line represents the best affine trans- 
formation for the training statistics. 

3.2 CLASSIFICATION AND EVALUATION 

After obtaining the modified statistics, we used standard LACIE classification 
procedures and the Laboratory for Applications of Remote Sensing System (LARSYS) 
implemented on the Univac 1108 computer to classify the R-SEG's. A two-class clas- 
sifier was used with equal a priori probabilities for wheat and nonwheat. Within 
each class, the subclasses had equal a priori probabilities. A 1-percent chi- 
squared threshold was used. For the simulated data, entire areas were classified; 
for the consecutive-day data, the ground-truth areas were classified. 

3.2.1 C^SSIFICATION ACCURACY. The classification accuracy was determined for 
wheat and nohwheat by using the training fields defined in section 2 as test fields. 
From these, the overall accuracy was computed. This is given by: 

Overall accuracy = q^ (w/w) + q^j^p (i(!/<)>) (1) 

where p(w/w) = wheat accuracy, p( $/<(>) = nonwheat accuracy, q^, = wheat proportion in 
ground- truth area, and q^ = nonwheat proportion in ground- truth area. The propor- 
tions q^ and q,j| were known from ground truth. The wheat, nonwheat, and overall 
accuracies were compared with the results obtained from local classification 
(section 4) . 

3.2.2 WHEAT PROPORTIONS. The classification results yielded wheat proportions 
for the ground-truth areas defined in section 2. In addition, the UHMLE progr^un 
yielded a maximum likelihood estimate of the wheat proportions. These results were 
compared with the ground-truth proportions and the results obtained from local 
classification. 


4 . RESULTS 

The results of this processing are given in tables 1 to 11. Table 1 gives the 
A and B coefficients determined for the consecutive-day data by those algorithms 
which produce an affine transformation. The algorithms are listed in the order in 
which they performed in the accuracy test. 

Based on numerical calculations using an atmospheric model [6], certain con- 
straints are expected to apply to the A and B coefficients corresponding to a change 
in the haze level. These should apply to the consecutive-day data if the haze 
levels present are uniform. Among these constraints, which apply to all channels, 
are the following : 

1. If there is no difference in haze level between the T-SEG and the R-SEG, A ° 1.0 
and B = 0.0. 

2. If the T-SEG has more haze than the R-SEG, A > 1.0 and B < 0.0. 

3. If the T-SEG has less haze thcui the R-SEG, A < 1.0 and B > 0.0. 

In many cases, the data in table 1 do not obey these rules. Examples can be 
found in the following anomalies: 

1. A > 1.0 for some channels and A < 1.0 for others; e.g., R(S) for F1655-4. 

2. A > 1.0 and B > 0.0; e.g., MLEST for F1673-2. 

3. A < 1.0 and B < 0.0; e.g., R(C) for F1726-7, channel 2. 

These failures to obey the constraints probably are due in part to nonuniform haze 
levels in the data and to changes in the look angle. 
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Tables 2 , 3 , 4 , and 5 give the accuracy results for wheat and nonwheat using 
both data sets. The accuracy obtained with signature extension is expressed as a 
percentage difference from the local result; i.e.. 


Percentage difference = 


signature extension accuracy - local accuracy 
’’ local accuracy 


X 100% 


( 2 ) 


Tables 6 and 7 give similar results for overall accuracy. 

Tables 8 to 11 give the differences for both data sets (1) between results 
obtained using signature extension and local classification and (2) between results 
obtained using signature extension and ground truth. The means and standard devia- 
tions were obtained using the absolute values of the numbers in the tables. 


5. ANALYSIS 


In this section, a statistical analysis is performed on the data in tables 6 
and 10. Data for the UHMLE algorithm were not included because of their large 
variances. 


First, an analysis of variance was performed on the data in table 6. The pur- 
pose of an analysis of variance is to separate a response variable into component 
parts. In this way, the test for a particular factor will become more sensitive 
because variations due to other causes have been removed. In this experiment, two 
factors were present: signature extension algorithms and the seven consecutive-day 

acquisitions. The second factor could have been grouped several different ways, on 
the bases of days, sites, and presence of haze. 

The last alternative was chosen. Each pass was classified as either clear or 
hazy by visually inspecting the images of the data produced by ERIPS. The results 
are shown in table 12. Three T-SEG-R-SEG combinations occurred; namely, haze-clear, 
clear-haze, and clear-clear. It was assumed that each combination would produce 
different results (classifications) , thus the need for this factor in the analysis. 

The interaction between the algorithms and haze combinations (AxH) was also 
expected to be present; that is, one algorithm might have performed well for the 
clear-haze consecutive-day acquisitions and poorly for the haze-clear days, whereas 
the opposite results might have occurred for another algorithm. 

The model for the experiment was 


'ijk 


+ a. + hj + ah. . + 


(3) 


where y = overall mean, a^ = contribution of the Lth algorithm, hj = contribution 
of the jth haze level, ah^j = contribution of algorithm i and haze level j to the 
interaction, eij)^ = error term for the observation for the ith algorithm and 
the jt)i haze level, and yi.i)j = response variable. In the analysis of variance for 
overall accuracy, yijk = percentage accuracy difference; that is, the quantity 
given in table 6 . 


The results of this analysis of variance are given in table 13, where signifi- 
cant differences between the algorithms and between the haze conditions are apparent. 


Table 14 gives the average accuracy difference over the algorithms for each haze 
condition. Because the analysis of variance indicated significant differences 
between haze conditions, we can infer from table 14 that the presence of haze over 
the T-SEG is significeuitly different from the other two conditions. 

The results for the different haze conditions were plotted as a function of the 
algorithms (fig. 1) . The condition with haze over the T-SEG shows consistently 
better results than the other two conditions. A similar analysis was performed for 
the wheat proportions. In this case, yij)c was the quantity given in table 10. 

R(S/C) was not included because of its large variance. The results show a signifi- 
cant difference between the haze conditions but not between the algorithms (table 15) 
Table 16 gives the average proportion difference over the algorithms for each haze 
condition, and figure 2 shows the performance of each algorithm for each of the three 
haze conditions. Here again, the haze-clear condition seems to give the best results 
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6 . CONCLUSIONS 


The results of these tests are summarized in table 17. The first two columns 
list the algorithms in the order in which they performed on the accuracy test for 
the simulated and consecutive-day data. The numbers given are the mean percentage 
differences between the accuracy obtained using the algorithms and local accuracy 
(see tables 6 and 7) . The minus signs indicate that the algorithm was less accurate 
than local classification. A statistical analysis was performed on the accuracy 
results for the consecutive-day data with the exception of data for the three ver- 
sions of UHMLE (which were omitted because of large variances) . The analysis indi- 
cated (1) no significant differences among the algorithms and (2) that the results 
obtained when the T-SEG appeared hazier than the R-SEG were better than in the other 
two conditions observed; i.e., when both were clear or the R-SEG was hazier. 

The comparison of wheat proportion differences (between ground truth and local 
results) in the last four colvimns of table 17 shows the performance order of the 
algorithms to be the scune for simulated data but quite different for consecutive-day 
data. This was because local results were quite different from ground-truth results 
for the consecutive-day data. These four columns of numbers are the means of the 
absolute values of the differences as given in tables 8, 9, 10, and 11. A statisti- 
cal analysis was performed on the consecutive-day data for wheat proportion differ- 
ences from local results. Data from R(S/C) and the three versions of UHMLE were not 

used because of large variances. The results given in table 15 Indicate no signifi- 
cant differences £unong the algorithms tested. Here again, the best results were 

obtained when the T-SEG appeared hazier than the R-SEG. 

Finally, it must be mentioned that, because of time limitations, this test was 
performed using the currently available algorithms. Subsequently, it has been dis- 
covered that some of the algorithms show better performance when later versions are 
used. For exeunple, the progr^un UHMLE has a later version that begins with the trans- 
formation (x + b) before estimating the R-SEG statistics. However, the results pre- 
sented in this paper provide illustrative information which C 2 ui be used in solving 
the signature extension problem. 
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0.0 

0.99 

3.0 

0.97 

3.7 

0.98 

3.1 

0.99 

2.9 

1.03 

- 0.6 

0.94 

3.6 

1.13 

Bfl 


1.08 

Bl 

1.02 

0.0 

1.01 

1.0 

1.01 

1.0 

1.01 

1.2 

1.01 

1.1 

1.02 

0.5 

0.95 

1.4 

1.15 

B 


TABLE 2. WHEAT ACCURACY FOR TABLE 3. NONWHEAT ACCURACY FOR 

SIMULATED DATA* SIMULATED DATA* 


Data 

Local 

Percentage difference between 
local accuracy and that obtained 
with various algorlthu 

■ 

Data 

Local 

Percentage difference between 
local accuracy and that obtained 
with various algorithms 

R(S) 

MLEST 

UH 

fields 

R(C) 

1 

UT 

R(S) 

MLEST 

UH 

fields 

R(C) 

UT 

SIHl 

84.4 

-2.0 

6.3 

-100 

-28.4 

-100 


SD41 

96.4 

0.5 

-6.3 

-0.2 

-30.0 

-99.1 

SIM2 

97.1 

0.0 

1.2 

-2.9 

0.0 

-26.3 


SIM2 

99.1 

0.0 

-0.5 

-0.1 

0.0 

-15.8 

S1M3 

94.8 

0.2 

3.5 

2.8 

-12.8 

-84.4 


SIM3 

97.7 

0.0 

-1.1 

-2.3 

-2.9 

-39.5 

SIM4 

87.9 

-0.1 

6.5 

-13.5 

-1.7 

-28.0 


SOI4 

94.3 

-0.1 

-6.0 

-2.4 

-3.2 

-3.2 

Mean 

91.1 

-0.5 

4.4 

-28.4 

-10.7 

-59.7 


Mean 

96.9 

0.1 

-3.5 

-1.3 

-9.0 

-39.4 

Std. dev. 

5.9 

1.0 

2.5 

48.2 

13.1 

38.1 


Std. dev. 

2.0 

0.3 

3.1 

1.3 

14.1 

42.5 


TABLE 4. WHEAT ACCURACY FOR CONSECUTIVE-DAY DATA* 




Percentage difference between local 

accuracy and 

that obtained with various algorithsis 

Data 

Local 

RtS) 

MLEST 

OSCAR 

REGRES 

MOO R 

R(C) 

MOO 

OSCAR 

ATCOR 

UH 

fields 

UT 

R(S/C) 

UH all 

F1709-8 

96.7 

0.5 

1.1 

0.7 

1.2 


1.1 

1.3 

1.2 


1.1 


-21.9 

F1673-2 

97.3 

-0.8 

0.9 

-3.9 

-6.6 

Bfl 

-3.8 

-2.9 

-2.9 

mm 

-0.6 

B9 


F1655-4 

93.5 

-15.5 

2.7 

-12.8 

-11.1 

-12.7 

-17.8 

-14.5 

-19.3 

BB 

-17.8 

-1.8 


F1726-7 

82.6 

-3.3 

-6.7 

-2.3 

-3.0 

-5.9 

-10.9 

-1.9 

-1.1 

-27.4 

-2.4 

-8.4 

Bfl 

S1455-4 

92.0 

1.0 

1.6 

-4.0 

-2.7 

-7.1 

-11.7 

-6.1 

-0.7 

-16.8 

0.0 


-42.6 

S1725-4 

79.7 

7.3 

2.0 

4.6 

7.0 

-0.9 

B« 

1.8 

-0.6 

1.8 

-20.2 


21.3 

E1726-5 

92.6 

-1.9 

1.1 

-0.3 

-2.3 

-1.5 

Bfl 

-0.4 

-6.2 

-50.9 

-1.0 

B 

-49.2 

Mean 

90.6 

0.1 

0.4 

-2.6 

-2.5 

-4.3 

-4.8 

-3.2 

-4.2 

-17.5 

-6.8 

Bi 

-21.5 

Std. dev. 

6.8 

3.5 

3.2 

5.4 

5.7 

4.6 

9.1 

5.6 

7.0 

17.5 

9.5 

8.3 

27.2 


*A minus sign means the algorithm was less accurate than local classification. 
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TABLE 5. NONWHEAT ACCURACY FOR CONSECUTIVE-DAY DATA* 


D«ta 

Local 

Percenta 9 e difference between local accuracy and 

that obtained 

with varioua algorlthma 

R(S) 

MLEST 

1 

OSCAR 

REGRES 

MOD R 

R(C) 

MOD 

OSCAR 

ATCOR 

UH 

flelda 

UT 

R{S/C) 

UR all 

P1709-8 

73.9 

-8.5 

-6.8 

-10.3 

-10.6 

-11.2 

-12.0 

-11.5 

-12.4 

10.7 

-12.0 

-18.7 

19.9 

F1673-2 

95.7 

-2.3 

-1.0 


-11.5 

1.6 

-0.9 

0.1 

-5.7 

-27.2 

0.3 

-2.3 

-30.8 

P1655-4 

95.4 

0.4 

-3.4 

mm 

0.7 

0.4 

-0.5 

0.6 

1.4 


0.6 

-4.4 

-4.0 

P1726-7 

79.1 

3.7 

3.9 

5.8 

7.6 

-0.4 

2.3 

3.9 

-7.5 


-10.5 

-6.6 

-6.8 

SX455-4 

78.9 

-2.3 

-S.l 

-2.8 

-0.5 

3.0 

7.6 

3.0 

1.3 


mm 

-5.8 

-8.0 

S1725-4 

93.5 

-6.5 

-3.5 

-7.7 

-8.4 

-6.1 

-14.7 

-5.7 

-9.7 

-11.8 


-8.1 

-23.5 

E1726-S 

45.2 

-5.1 

-17.3 

-9.3 

-5.3 

-2.2 

-11.1 

-25.2 

3.1 

86.1 

-28.5 

-31.4 

61.3 

Mean 

80.2 

-2.9 

-2.2 

-3.7 

-4.0 

-2.1 

-4.2 

-5.0 

-4.2 

9.0 

-9,5 

-11.0 

1.2 

Std. dev. 

17.9 

4.2 

12.4 

5.9 

6.9 

5.0 

8.4 

10.4 

6.1 

36.5 

10.7 

10.4 

31.0 


TABLE 6. OVERALL ACCURACY FOR CONSECUTIVE-DAY DATA* 


Data 

Local 

Percentage difference between local accuracy and that obtained 
with varioua algorlthaM 

R(S) 

MLEST 

OSCAR 

REGRES 

MOD R 

R(C) 

MOO 

OSCAR 

ATCOR 

UH 

flelda 

nr 

R(S/C) 

UH all 

P1709-8 

79.5 

-5.8 

Bn 

Bn 

■a 

-7.6 

-8.1 

-7.8 

-8.5 

2.7 

-8.2 


7.3 

P1673-2 

96.1 

-2.0 


ig 


0.5 

-1.7 


-5.0 

-21.3 

0.1 

B9 

-23.7 

P1655-4 

94.9 

-3.3 

-1.8 

Bn 

mm 

-2.7 

-4.7 


-3.6 

-3.1 

-3.8 

-3.8 


P1726-7 

80.0 

1.9 


3.8 

BBl 

-1.9 

-1.1 


-5.9 

0.9 

-8.5 

■n 

■fi 

S1455-4 

86.5 

-0.2 

-0.9 

Bn 

-1.8 

-3.2 

-4.4 


0.1 

-12.1 

0.0 

Bn 

-29.5 

S1725-4 

85.4 

1.1 

-0.5 

mm 

mgm 

-3.2 

-1 .9 


-4.7 

-4.3 

-14.1 

-11.0 


E1726-S 

66.2 

-3.2 

-6.0 

-3.8 

m 

-1.8 

-4.1 

-9.8 

-2.7 

1.4 

-11.5 

-9.8 


Mean 

84.1 

-1.6 

-1.8 

Bn 

-2.8 

-2.8 

-3.7 

-3.8 

-4.3 

-5.1 

-6.6 

Bn 


Std. dev. 

10.2 

2.7 

2.6 

m 

4.9 

2.5 

2.4 

4.2 

2.7 

8.7 

5.5 

In 



TABLE 7. OVERALL ACCURACY FOR 
SIMULATED DATA* 


Data 

Local 

Percentage difference between 
local accuracy and that obtained 
with varioua algorlthaw 

R(S) 

MLEST 

UB 

flelda 

R(C) 

UT 

SINl 

93.5 

0.0 

-3.5 

-21.7 

-29.6 

-99.3 

SIN2 

98.6 

0.0 

0.0 



-18.3 

SIM3 

97.0 

0.1 

0.0 



-50.0 

SIM4 

92.8 

-0.1 

-3.2 


^B 

-8.8 


95.5 

0.0 

-1.7 



-44.1 


2.8 

0.1 

1.9 


13.6 

40.8 


TABLE 8. WHEAT PROPORTIONS FOR SIMULATED 
DATA AS DETERMINED USING LOCAL RESULTS 




Signature extension proportion sinus 
local proportion 

Data 

Local 

R(S) 

R(C) 

MLEST 

DB 

fields 

KLE 

UH 

fields 

UT 

SINl 

24.3 

-0.5 

-1.9 

0.6 

•16.6 

-22.3 

-24.3 

SIN2 

24.7 

0.0 

0.0 

0.6 

-1.0 

-1.1 


SIK3 

34.9 

0.1 

-3.1 

1.5 

1.7 

1.6 


SIM4 

34.2 

0.0 

-0.2 

6.8 

-0.3 

-0.6 

D 

Mean abaolute 

values 


0.2 

1.3 

2.4 

5.0 

6.4 

13.5 

Std. dev. 


0.2 

1.5 

3.0 

7,9 

10.6 

10.3 


TABLE 9. WHEAT PROPORTIONS FOR SIMULATED DATA AS DETERMINED USING GROUND TRUTH 



Ground 

truth 

Signature ext«islon proportion adnua 
ground-truth proportion 

Data 

R(6) 

1 

R(C) 

1 

MLEST 

OB 

fields 

KLE 

UB 

fields 

or 

SlMl 

23.9 

-0.1 

-1.5 

1.0 

-16.4 

-21.9 

-23.9 

SDt2 

23.9 

0.8 

0.8 

1.4 

-0.2 

-0.3 

-2.4 

SZH3 

23.9 

1.1 

-2.1 

2.5 

2.7 

2.6 

-19.0 

SIN4 

23.9 

0.3 

0.1 

7.1 

-0.3 

-0.3 

-6.0 

Mean abaolute 

values 


0.8 

1.1 

3.0 

4.9 

6.3 

12.8 

Std. dev. 


0.4 

0.9 

2.8 

7.8 

10.5 

10.3 


*A minus sign mecins the algorithir was less accurate than local classification. 
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TABLE 10. WHEAT PROPORTIONS FOR CONSECUTIVE-DAY DATA 
AS DETERMINED USING LOCAL RESULTS 


Acquiaition 

Local 



Signature axtanaion proportion minua local 

proportion 




REGRES 

OSCAR 

NOD R 

or 

NI.EST 

ATCOR 

MOD 

OSCAR 

R(S/C) 

R(C) 

QQ 

UH 

fielda 

OH all 
MLE 

P1709-8 

35.4 

5.9 

7.5 

9.0 

m 

9.8 

7.8 

10.7 

10.3 



BB 

B M 

-5.2 

ri673>2 

21.9 

3.1 

-2.0 

o.< 

fm 

1.9 

m 

-1.6 

-1.9 

m 

■g 


B 1 

22.5 

P165S-4 

27.7 

-2.2 

-2.2 

-2.1 

ig 

-4.7 

D 

-4.9 

-2.8 

2.8 

-3.8 

mm 


5.8 

P1726-7 

28.0 

-1.0 

-2.0 

-1.4 


3.1 

IM 

2.6 

-1.4 

1.9 

-2.6 


B 1 

-2.8 

S14SS-4 

53.7 

0.5 

-3.3 

-3.5 

-8.6 

-2.0 

4.8 

-3.0 

-7.6 

-0.3 

-13.3 

-11.4 

B 1 

-8.3 

S1725>4 

35.3 

4.< 

5.0 

5.3 

IB 

-0.8 

m 

4.4 

3.6 

-0.9 

9.6 

19.6 

B~9 

20.7 

E1724-5 

<1.9 

1.8 

1.4 

3.3 


<.< 

B 

-3.2 

6.0 

9.8 

3.6 

-25.1 

-36.1 

-29.7 

M«an Abaoluta 

valuaa 


m 

3.3 

3.6 

3.8 

4.1 

m 

4.3 

4.8 

5.2 

6.4 

12.8 

13.2 

13.6 

Std. dav. 


D 

2.2 

2.9 

3.< 

L±i 

B 

3.0 

3.3 

5.9 

4.6 

9.2 

10.7 

10.6 


TABLE 11. WHEAT PROPORTIONS FOR CONSECUTIVE-DAY DATA 
AS DETERMINED USING GROUND TRUTH 



Ground 

truth 

Signature axtanaion proportion ainua ground-truth proportion 

Data 

R<8) 

REGRES 

OSCAR 

1 

MOD R 

UT 

MLEST 

ATCOR 

MOD 

OSCAR 

R(S/C) 

R(C) 

UR all 

UH 

fialda 

UH all 
MLE 

P1709-8 

24.6 


18.3 



K ' 


21.5 


27.3 

21.1 

1.9 

2.3 

5.6 

ri673-2 

24.6 


2.3 

wm 

wm 

B: 

mm 

2.7 

wm 

8.7 

2.8 

25.0 

17.6 

26.8 

F1655-4 

24.6 


0.9 

n 

B 

B ! 

mm 

-1.8 

B 

msM 

-0.7 

4.7 

mam 

8.9 

fl726-7 

24.6 

BB 

2.2 

2.8 

mm 

B: 

3.8 

6.8 

n 

maa 

1.6 

1.8 

wsm 

1.4 

S1455-4 

58.3 


-7.9 


-13.2 

B ! 


-7.6 

-12.2 

BB 

-17.9 

-16.0 

-15.2 

-12.9 

S1725-4 

58.3 

-18^4 

-18.0 

-17.7 

-20.1 

-23.8 

-20.4 

-18.7 

-19.4 

-23.9 

-13.4 

-3.4 

-17.5 

-2.3 

E1726-5 

44.2 

19.5 

19.1 

21.0 

18.6 

24.3 

22.6 

14.5 

23.7 

27.5 

21.3 

-7.4 

-18.4 

-12.0 

Naan abaoluta 

valuaa 


10.0 

9.8 

10.8 

11.2 

12.9 

11.7 

10.5 

11.7 

14.9 

11.3 

8.6 

12.6 

10.0 

Std. dav. 


7.9 

8.4 

8.5 

8.8 

9.6 

8.8 

7.8 

9.9 

10.7 

9.3 

8.7 

6.2 

8.7 


TABLE 12. HAZE CONDITIONS ON 
CONSECUTIVE-DAY DATA AS 
DETERMINED BY INSPEC- 
TION OF IMAGES 


TABLE 13. ANALYSIS OF VARIANCE FOR 
OVERALL ACCURACY 


Source 

Degreea of 
f reedoa 

Sum of 

squares 

'^an 

square 

P- 

f actor 

Significance 

Algorithn (A) 

9 

217.61 

24.18 

2.02 

6% or 7% 

Rase (H) 

2 

113.69 

56.85 

4.74 

St 

AxR 

18 

205.32 

11.41 

.95 

NS 

Error 

40 

480.17 

12.00 



Total 

69 

1 016.79 





Data 

T-8BG 

R-SBG 

P1709-8 

Clear 

Clear 

P1673-2 

Bate 

Clear 

P1655-4 

Clear 

Clear 

F1726-7 

Base 

Clear 

81455-4 

Clear 

Clear 

S1725-4 

Clear 

Base 

E1726-5 

Clear 

Base 


TABLE 14. ACCURACY PERCENTAGE FOR 
THE THREE DIFFERENT HAZE 
CONDITIONS 


TABLE 15. ANALYSIS OF VARIANCE 
FOR WHEAT PROPORTIONS 


Source 

Degrees of 
freedom 

Sue of 

squares 

Mean 

square 

F- 

factor 

Significance 

Algorltha (A) 

8 

59.77 

7.47 

X 

NS 

Base (B) 

2 

156.27 

78.14 

9.42 

1% 

AxB 

16 

45.21 

2.83 

X 


Error 

36 

312.95 

8.69 



Total 

62 

574.20 



■B 


Base condition 

Percent accuracy 
difference 

T-SEG 

R-SEG 

Base 

Clear 

-1.71 

Clear 

Clear 

-4.26 

Clear 

Base 

-4.82 
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TABLE 16. AVERAGE PROPORTION DIFFERENCES 
FOR THE THREE HAZE CONDITIONS 


Haze condition 

Proportion difference 

T-SEG 

R-SEG 

Haze 

Clear 

2.0 

Clear 

Clear 

5.8 

Clear 

Haze 

3.9 


ORIGINAL 
OF POOR QUALITY 


TABLE 17. SUMMARY OF TEST RESULTS 


Percentage difference between 
local accuracy and that obtained 
with various algorithas 

tfheat proportions difference 
fro« local 

Wheat proportion difference from 
ground truth 

Simulated 

data 

Consecutive- 
day data 

Simulated 

data 

Consecutive- 
day data 

Simulated 

data 

Consecutive- 
day data 

R(S) 

■9 

R(S) 


R(8) 


R(S) 

2.7 

R(S) 

1 

0.8 

UH all 

8.6 

MLEST 

B9 

MLEST 


R(C) 


REGRES 

3.3 

R(C) 

1.1 

REGRES 

9.8 

xm fields 


OSCAR 


MLEST 


OSCAR 

3.6 

MLEST 

3.0 

UH all MLE 


R(C) 

mm 

REGRES 

- 2.8 

UH fields MLE 


MOO R 

3.8 

UH fields MLB 

4.9 

R(8> 


or 

- 44.1 

MOO R 

- 2.8 

OH fields 

Bi 

or 


UH fields 

6.3 

ATCOR 




R(C) 

- 3.7 

UT 

13.5 

MLEST 


or 

12.8 

OSCAR 




MOO OSCAR 

- 3.8 



ATCOR 




MOO R 

11.2 



ATCOR 

- 4.3 



MOD OSCAR 




R(C) 

11.3 



UB fields 

- 5.1 



R(S/C) 




MLEST 

fJWM 



UT 

- 6.6 



R(C) 

^2 



MOD OSCAR 




R(S/C) 

- 7.1 



UB all 

12.8 



UH fields 

^^9 



UB all 

- 10.6 



UB fields 

13.2 



UT 

12.9 







UB all MLE 

13.6 



R(S/C) 

14.9 



FIGURE 1. HAZE-BY-ALGORITHM INTERACTION. 
Overall accuracy difference is shown for 
the three haze conditions. 



FIGURE 2. HAZE-BY-ALGORITHM INTERACTION. 
Proportion differences are shown for the 
three haze conditions. 
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